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Input. Set : N:\paola\09982667.txt 
Output Set: N:\CRF3\05142002\I982667.raw 

SEQUENCE LISTING 

GENERAL INFORMATION: 
(i) APPLICANT: Prudent, James R. 

Hall, Jeff G. 
Lyamichev, Victor I. 
(ii) TITLE OF INVENTION: Invasive Cleavge Of Nucleic Acids 
(iii) NUMBER OF SEQUENCES: 69 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll, LLP 

(B) STREET: 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States Of America 

(F) ZIP: 94104 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
OB) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/982,667 

(B) FILING DATE: 18 -Oct- 2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/756,386 

(B) FILING DATE 

(A) APPLICATION 

(B) FILING DATE 

(A) APPLICATION 

(B) FILING DATE 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Ingolia, Diane E. 

(B) REGISTRATION NUMBER: 40,027 

(C) REFERENCE/DOCKET NUMBER: FORS- 02564 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: £415) 397-8338 
INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



NUMBER: US 08/682,853 

12-JUL-1996 
NUMBER: US 08/599,491 
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RAW SEQUENCE LISTING DATE: 05/14/2002 

PATENT APPLICATION: US/09/982 r 667 TIME: 15:01:09 

Input Set : N:\paola\09982667.txt 
Output Set: N:\CRF3\05142002\l982667.raw 

64 (ii) MOLECULE TYPE: DNA (genomic) 

68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

70 ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 60 

72 CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 120 

74 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA GGACGGGGAC 180 

76 GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGGGGG 240 

78 TACAAGGCGG GCCGGGCCCC CACGCCGGAG GACTTTCCCC GGCAACTCGC CCTCATCAAG 300 

80 GAGCTGGTGG ACCTCCTGGG GCTGGCGCGC CTCGAGGTCC CGGGCTACGA GGCGGACGAC 360 

82 GTCCTGGCCA GCCTGGCCAA GAAGGCGGAA AAGGAGGGCT ACGAGGTCCG CATCCTCACC 420 

84 GCCGACAAAG ACCTTTACCA GCTCCTTTCC GACCGCATCC ACGTCCTCCA CCCCGAGGGG 4 80 

86 TACCTCATCA CCCCGGCCTG GCTTTGGGAA AAGTACGGCC TGAGGCCCGA CCAGTGGGCC 540 

88 GACTACCGGG CCCTGACCGG GGACGAGTCC GACAACCTTC CCGGGGTCAA GGGCATCGGG 600 

GAGAAGACGG CGAGGAAGCT TCTGGAGGAG TGGGGGAGCC TGGAAGCCCT CCTCAAGAAC 660 

CTGGACCGGC TGAAGCCCGC CATCCGGGAG AAGATCCTGG CCCACATGGA CGATCTGAAG 720 

94 CTCTCCTGGG ACCTGGCCAA GGTGCGCACC GACCTGCCCC TGGAGGTGGA CTTCGCCAAA 780 

96 AGGCGGGAGC CCGACCGGGA GAGGCTTAGG GCCTTTCTGG AGAGGCTTGA GTTTGGCAGC 840 

98 CTCCTCCACG AGTTCGGCCT TCTGGAAAGC CCCAAGGCCC TGGAGGAGGC CCCCTGGCCC 900 

100 CCGCCGGAAG GGGCCTTCGT GGGCTTTGTG CTTTCCCGCA AGGAGCCCAT GTGGGCCGAT 960 

102 CTTCTGGCCC TGGCCGCCGC CAGGGGGGGC CGGGTCCACC GGGCCCCCGA GCCTTATAAA 1020 

104 GCCCTCAGGG ACCTGAAGGA GGCGCGGGGG CTTCTCGCCA AAGACCTGAG CGTTCTGGCC 1080 

106 CTGAGGGAAG GCCTTGGCCT CCCGCCCGGC GACGACCCCA TGCTCCTCGC CTACCTCCTG 1140 

108 GACCCTTCCA ACACCACCCC CGAGGGGGTG GCCCGGCGCT ACGGCGGGGA GTGGACGGAG • 1200 

110 GAGGCGGGGG AGCGGGCCGC CCTTTCCGAG AGGCTCTTCG CCAACCTGTG GGGGAGGCTT 1260 

112 GAGGGGGAGG AGAGGCTCCT TTGGCTTTAC CGGGAGGTGG AGAGGCCCCT TTCCGCTGTC 1320 

114 CTGGCCCACA TGGAGGCCAC GGGGGTGCGC CTGGACGTGG CCTATCTCAG GGCCTTGTCC 1380 

116 CTGGAGGTGG CCGAGGAGAT CGCCCGCCTC GAGGCCGAGG TCTTCCGCCT GGCCGGCCAC 1440 

118 CCCTTCAACC TCAACTCCCG GGACCAGCTG GAAAGGGTCC TCTTTGACGA GCTAGGGCTT 1500 

120 CCCGCCATCG GCAAGACGGA GAAGACCGGC AAGCGCTCCA CCAGCGCCGC CGTCCTGGAG 1560 

122 GCCCTCCGCG AGGCCCACCC CATCGTGGAG AAGATCCTGC AGTACCGGGA GCTCACCAAG 1620 

124 CTGAAGAGCA CCTACATTGA CCCCTTGCCG GACCTCATCC ACCCCAGGAC GGGCCGCCTC 1680 

126 CACACCCGCT TCAACCAGAC GGCCACGGCC ACGGGCAGGC TAAGTAGCTC CGATCCCAAC 1740 

128 CTCCAGAACA TCCCCGTCCG CACCCCGCTT GGGCAGAGGA TCCGCCGGGC CTTCATCGCC 1800 

130 GAGGAGGGGT GGCTATTGGT GGCCCTGGAC TATAGCCAGA TAGAGCTCAG GGTGCTGGCC 1860 

132 CACCTCTCCG GCGACGAGAA CCTGATCCGG GTCTTCCAGG AGGGGCGGGA CATCCACACG 1920 

134 GAGACCGCCA GCTGGATGTT CGGCGTCCCC CGGGAGGCCG TGGACCCCCT GATGCGCCGG 1980 

136 GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG CCTCTCCCAG 2040 

138 GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT TCAGAGCTTC 2100 

140 CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG GGGGTACGTG 2160 

142 GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT GAAGAGCGTG 2220 

144 CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC CGCCGACCTC 2280 

146 ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC CAGGATGCTC 2340 

148 CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC 2400 

150 CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG CCGTGCCCCT GGAGGTGGAG 2460 

152 GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 2506 
154 (2) INFORMATION FOR SEQ ID NO: 2: 

156 (i) SEQUENCE CHARACTERISTICS: 

157 (A) LENGTH: 2496 base pairs 

158 (B) TYPE: nucleic acid 

159 (C) STRANDEDNESS : double 
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160 
162 
166 


(D) TOPOLOGY: 
(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRI 


linear 

DNA (genomic) 
PTION: SEQ ID NO : 2: 


168 ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC TCCTGGTGGA CGGCCACCAC 

170 CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA CCAGCCGCGG CGAACCCGTT 

172 CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC TGAAGGAGGA CGGGGACGTG 

174 GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC ACGAGGCCTA CGAGGCCTAC 

176 AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC AGCTGGCCCT CATCAAGGAG 

178 TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG GCTTTGAGGC GGACGACGTG 

180 CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG AGGTGCGCAT CCTCACTGCC 

182 GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA TCCTCCACCC TGAGGGGTAC 

184 CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC GCCCGGAGCA GTGGGTGGAC 

186 TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG GGGTGAAGGG CATCGGGGAG 

188 AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG AAAACCTCTT CCAGCACCTG 

190 GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG GCATGGAGGC CCTGGCCCTT 

192 TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG AGGTGGACTT CGGGAGGCGC 

194 CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC GGTTGGAGTT TGGAAGCCTC 

196 CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG AGGAGGCCCC CTGGCCCCCT 

198 CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG AGCCCATGTG GGCCGAGCTT 

200 CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC CCTTAGGGGC 

202 CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT TTTGGCCCTG 

204 CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC TCCTGGCCTA CCTTCTGGAC 

206 CCCTCCAACA CCACCCCTGA GGGGGTGGCC CGGCGTTACG GGGGGGAGTG GACGGAGGAT 

208 GCGGGGGAGA GGGCCCTCCT GGCCGAGCGC CTCTTCCAGA CCCTAAAGGA GCGCCTTAAG 

210 GGAGAAGAAC GCCTGCTTTG GCTTTACGAG GAGGTGGAGA AGCCGCTTTC CCGGGTGTTG 

212 GCCCGGATGG AGGCCACGGG GGTCCGGCTG GACGTGGCCT ACCTCCAGGC CCTCTCCCTG 

214 GAGGTGGAGG CGGAGGTGCG CCAGCTGGAG GAGGAGGTCT TCCGCCTGGC CGGCCACCCC 

216 TTCAACCTCA ACTCCCGCGA CCAGCTGGAG CGGGTGCTCT TTGACGAGCT GGGCCTGCCT 

218 GCCATCGGCA AGACGGAGAA GACGGGGAAA CGCTCCACCA GCGCTGCCGT GCTGGAGGCC 

220 CTGCGAGAGG CCCACCCCAT CGTGGACCGC ATCCTGCAGT ACCGGGAGCT CACCAAGCTC 

222 AAGAACACCT ACATAGACCC CCTGCCCGCC CTGGTCCACC CCAAGACCGG CCGGCTCCAC. 

224 ACCCGCTTCA ACCAGACGGC CACCGCCACG GGCAGGCTTT CCAGCTCCGA CCCCAACCTG 

226 CAGAACATCC CCGTGCGCAC CCCTCTGGGC CAGCGCATCC GCCGAGCCTT CGTGGCCGAG 

228 GAGGGCTGGG TGCTGGTGGT CTTGGACTAC AGCCAGATTG AGCTTCGGGT CCTGGCCCAC 

230 CTCTCCGGGG ACGAGAACCT GATCCGGGTC TTTCAGGAGG GGAGGGACAT CCACACCCAG 

232 ACCGCCAGCT GGATGTTCGG CGTTTCCCCC GAAGGGGTAG ACCCTCTGAT GCGCCGGGCG 

234 GCCAAGACCA TCAACTTCGG GGTGCTCTAC GGCATGTCCG CCCACCGCCT CTCCGGGGAG 

236 CTTTCCATCC CCTACGAGGA GGCGGTGGCC TTCATTGAGC GCTACTTCCA GAGCTACCCC 

238 AAGGTGCGGG CCTGGATTGA GGGGACCCTC GAGGAGGGCC GCCGGCGGGG GTATGTGGAG 

240 ACCCTCTTCG GCCGCCGGCG CTATGTGCCC GACCTCAACG CCCGGGTGAA GAGCGTGCGC 

242 GAGGCGGCGG AGCGCATGGC CTTCAACATG CCGGTCCAGG GCACCGCCGC CGACCTCATG 

244 AAGCTGGCCA TGGTGCGGCT TTTCCCCCGG CTTCAGGAAC TGGGGGCGAG GATGCTTTTG 

246 CAGGTGCACG ACGAGCTGGT CCTCGAGGCC CCCAAGGACC GGGCGGAGAG GGTAGCCGCT 

248 TTGGCCAAGG AGGTCATGGA GGGGGTCTGG CCCCTGCAGG TGCCCCTGGA GGTGGAGGTG 

250 GGCCTGGGGG AGGACTGGCT CTCCGCCAAG GAGTAG 
252 (2) INFORMATION FOR SEQ ID NO: 3: 

254 (i) SEQUENCE CHARACTERISTICS: 

255 (A) LENGTH: 2504 base pairs 

256 (B) TYPE: nucleic acid 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2496 


file://C:\CRF3\Outhold\VsrI982667.htm 


5/14/02 


Page 4 of 8 


RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/982,667 


DATE: 05/14/2002 
TIME: 15:01:09 


Input Set : N:\paola\09982667.txt 
Output Set: N:\CRF3\05142002\l982667.raw 


257 (C) STRANDEDNESS: double 

258 (D) TOPOLOGY: linear 

260 (ii) MOLECULE TYPE: DNA (genomic) 

264 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

266 ATGGAGGCGA TGCTTCCGCT CTTTGAACCC AAAGGCCGGG 
268 CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA 
270 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG 
272 AAGGCCGTCT TCGTGGTCTT TGACGCCAAG GCCCCCTCCT 
274 GCCTACAAGG CGGGGAGGGC CCCGACCCCC GAGGACTTCC 
276 AAGGAGCTGG TGGACCTCCT GGGGTTTACC CGCCTCGAGG 
278 GACGTTCTCG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG 
280 ACCGCCGACC GCGACCTCTA CCAACTCGTC TCCGACCGCG 
282 GGCCACCTCA TCACCCCGGA GTGGCTTTGG GAGAAGTACG 
284 GTGGACTTCC GCGCCCTCGT GGGGGACCCC TCCGACAACC 
286 GGGGAGAAGA CCGCCCTCAA GCTCCTCAAG GAGTGGGGAA 
288 AACCTGGACC GGGTAAAGCC AGAAAACGTC CGGGAGAAGA 
290 CTCAGGCTCT CCTTGGAGCT CTCCCGGGTG CGCACCGACC 
292 GCCCAGGGGC GGGAGCCCGA CCGGGAGGGG CTTAGGGCCT 
294 GGCAGCCTCC TCCACGAGTT CGGCCTCCTG GAGGCCCCCG 
296 TGGCCCCCGC CGGAAGGGGC CTTCGTGGGC TTCGTCCTCT 
298 GCGGAGCTTA AAGCCCTGGC CGCCTGCAGG GACGGCCGGG 
300 TTGGCGGGGC TAAAGGACCT CAAGGAGGTC CGGGGCCTCC 
302 TTGGCCTCGA GGGAGGGGCT AGACCTCGTG CCCGGGGACG 
304 CTCCTGGACC CCTCCAACAC CACCCCCGAG GGGGTGGCGC 
306 ACGGAGGACG CCGCCCACCG GGCCCTCCTC TCGGAGAGGC 
308 CGCCTCGAGG GGGAGGAGAA GCTCCTTTGG CTCTACCACG 
310 CGGGTCCTGG CCCACATGGA GGCCACCGGG GTACGGCTGG 
312 CTTTCCCTGG AGCTTGCGGA GGAGATCCGC CGCCTCGAGG 
314 GGCCACCCCT TCAACCTCAA CTCCCGGGAC CAGCTGGAAA 
316 AGGCTTCCCG CCTTGGGGAA GACGCAAAAG ACAGGCAAGC 
318 CTGGAGGCCC TACGGGAGGC CCACCCCATC GTGGAGAAGA 
320 ACCAAGCTCA AGAACACCTA CGTGGACCCC CTCCCAAGCC 
322 CGCCTCCACA CCCGCTTCAA CCAGACGGCC ACGGCCACGG 
324 CCCAACCTGC AGAACATCCC CGTCCGCACC CCCTTGGGCC 
326 GTGGCCGAGG CGGGTTGGGC GTTGGTGGCC CTGGACTATA 
328 CTCGCCCACC TCTCCGGGGA CGAAAACCTG ATCAGGGTCT 
330 CACACCCAGA CCGCAAGCTG GATGTTCGGC GTCCCCCCGG 
332 CGCCGGGCGG CCAAGACGGT GAACTTCGGC GTCCTCTACG 
334 TCCCAGGAGC TTGCCATCCC CTACGAGGAG GCGGTGGCCT 
336 GCTTCCCCAA GGTGCGGGCC TGGATAGAAA AGACCCTGGA 
338 ACGTGGAAAC CCTCTTCGGA AGAAGGCGCT ACGTGCCCGA 
340 GCGTCAGGGA GGCCGCGGAG CGCATGGCCT TCAACATGCC 
342 ACCTCATGAA GCTCGCCATG GTGAAGCTCT TCCCCCGCCT 
344 TGCTCCTCCA GGTCCACGAC GAGCTCCTCC TGGAGGCCCC 
346 TGGCGGCTTT GGCCAAGGAG GCCATGGAGA AGGCCTATCC 
348 TGGAGGTGGG GATGGGGGAG GACTGGCTTT CCGCCAAGGG 
350 (2) INFORMATION FOR SEQ ID NO : 4: 

352 (i) SEQUENCE CHARACTERISTICS: 

353 (A) LENGTH: 832 amino acids 


TCCTCCTGGT 

GGACGGCCAC 

60 

CCACGAGCCG 

GGGCGAACCG 

120 

CCCTGAAGGA 

GGACGGGTAC 

180 

TCCGCCACGA 

GGCCTACGAG 

240 

CCCGGCAGCT 

CGCCCTCATC 

300 

TCCCCGGCTA 

CGAGGCGGAC 

360 

GGTACGAGGT 

GCGCATCCTC 

420 

TCGCCGTCCT 

CCACCCCGAG 

480 

GCCTCAGGCC 

GGAGCAGTGG 

540 

TCCCCGGGGT 

CAAGGGCATC 

600 

GCCTGGAAAA 

CCTCCTCAAG 

660 

TCAAGGCCCA 

CCTGGAAGAC 

720 

TCCCCCTGGA 

GGTGGACCTC 

780 

TCCTGGAGAG 

GCTGGAGTTC 

840 

CCCCCCTGGA 

GGAGGCCCCC 

900 

CCCGCCCCGA 

GCCCATGTGG 

960 

TGCACCGGGC 

AGCAGACCCC 

1020 

TCGCCAAGGA 

CCTCGCCGTC 

1080 

ACCCCATGCT 

CCTCGCCTAC 

1140 

GGCGCTACGG 

GGGGGAGTGG 

1200 

TCCATCGGAA 

CCTCCTTAAG 

1260 

AGGTGGAAAA 

GCCCCTCTCC 

1320 

ACGTGGCCTA 

CCTTCAGGCC 

1380 

AGGAGGTCTT 

CCGCTTGGCG 

1440 

GGGTGCTCTT 

TGACGAGCTT 

1500 

GCTCCACCAG 

CGCCGCGGTG 

1560 

TCCTCCAGCA 

CCGGGAGCTC 

1620 

TCGTCCACCC 

GAGGACGGGC 

1680 

GGAGGCTTAG 

TAGCTCCGAC 

1740 

AGAGGATCCG 

CCGGGCCTTC 

1800 

GCCAGATAGA 

GCTCCGCGTC 

1860 

TCCAGGAGGG 

GAAGGACATC 

1920 

AGGCCGTGGA 

CCCCCTGATG 

1980 

GCATGTCCGC 

CCATAGGCTC 

2040 

TTATAGAGGC 

TACTTCCAAA 

2100 

GGAGGGGAGG 

AAGCGGGGCT 

2160 

CCTCAACGCC 

CGGGTGAAGA 

2220 

CGTCCAGGGC 

ACCGCCGCCG 

2280 

CCGGGAGATG 

GGGGCCCGCA 

2340 

CCAAGCGCGG 

GCCGAGGAGG 

2400 

CCTCGCCGTG 

CCCCTGGAGG 

2460 

TTAG 


2504 
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354 


(B) TYPE: amino acid 










355 


(C) STRANDEDNESS : single 









356 


(D) TOPOLOGY: linear 










358 

(ii) 

MOLECULE TYPE: protein 










362 

(xi) 

SEQUENCE DESCRIPTION: SEQ ID NO 

: 4: 







364 

Met 

Arg 

Gly 

Met 

Leu 

Pro 

Leu 

Phe 

Glu 

Pro 

Lys 

Gly 

Arg 

Val 

Leu 

Leu 

365 

1 




5 





10 





15 


367 

Val 

Asp 

Gly 

His 

His 

Leu 

Ala 

Tyr 

Arg 

Thr 

Phe 

His 

Ala 

Leu 

Lys 

Gly 

368 




20 





25 





30 



370 

Leu 

Thr 

Thr 

Ser 

Arg Gly Glu 

Pro 

Val 

Gin 

Ala 

Val 

Tyr 

Gly 

Phe 

Ala 

371 



35 





40 





45 




373 

Lys 

Ser 

Leu 

Leu 

Lys 

Ala 

Leu 

Lys 

Glu 

Asp 

Gly 

ASp 

Ala 

Val 

He 

Val 

374 


50 





55 





60 





376 

Val 

Phe 

Asp 

Ala 

Lys 

Ala 

Pro 

Ser 

Phe 

Arg 

His 

Glu 

Ala 

Tyr 

Gly 

Gly 

377 

65 





70 





75 





80 

379 

Tyr 

Lys 

Ala 

Gly 

Arg 

Ala 

Pro 

Thr 

Pro 

Glu 

Asp 

Phe 

Pro 

Arg 

Gin 

Leu 

380 





85 





90 





95 


382 

Ala 

Leu 

He 

Lys 

Glu 

Leu 

Val 

Asp 

Leu 

Leu 

Gly 

Leu 

Ala 

Arg 

Leu 

Glu 

383 




100 





105 





110 



385 

Val 

Pro 

Gly 

Tyr 

Glu 

Ala 

Asp 

ASp 

Val 

Leu 

Ala 

Ser 

Leu 

Ala 

Lys 

Lys 

386 



115 





120 





125 




388 

Ala 

Glu 

Lys 

Glu 

Gly 

Tyr 

Glu 

Val 

Arg 

He 

Leu 

Thr 

Ala 

Asp 

Lys 

Asp 

389 


130 





135 





140 





391 

Leu 

Tyr 

Gin 

Leu 

Leu 

Ser 

Asp 

Arg 

He 

His 

Val 

Leu 

His 

Pro 

Glu 

Gly 

392 

145 





150 





155 





160 

394 

Tyr 

Leu 

He 

Thr 

Pro 

Ala 

Trp 

Leu 

Trp 

Glu 

Lys 

Tyr 

Gly 

Leu 

Arg 

Pro 

395 





165 





170 





175 


397 

Asp 

Gin 

Trp 

Ala 

ASp 

Tyr 

Arg 

Ala 

Leu 

Thr 

Gly 

Asp 

Glu 

Ser 

Asp 

Asn 

398 




180 





185 





190 



400 

Leu 

Pro 

Gly 

Val 

Lys 

Gly 

He 

Gly 

Glu 

Lys 

Thr 

Ala 

Arg 

Lys 

Leu 

Leu 

401 



195 





200 





205 




403 

Glu 

Glu 

Trp 

Gly 

Ser 

Leu 

Glu 

Ala 

Leu 

Leu 

Lys 

Asn 

Leu 

Asp 

Arg 

Leu 

404 


210 





215 





220 





406 

Lys 

Pro 

Ala 

He 

Arg 

Glu 

Lys 

He 

Leu 

Ala 

His 

Met 

Asp 

Asp 

Leu 

Lys 

407 

225 





230 





235 





240 

409 

Leu 

Ser 

Trp 

Asp 

Leu 

Ala 

Lys 

Val 

Arg 

Thr 

Asp 

Leu 

Pro 

Leu 

Glu 

Val 

410 





245 





250 





255 


412 

Asp 

Phe 

Ala 

Lys 

Arg 

Arg 

Glu 

Pro 

Asp 

Arg 

Glu 

Arg 

Leu 

Arg 

Ala 

Phe 

413 




260 





265 





270 



415 

Leu 

Glu 

Arg 

Leu 

Glu 

Phe 

Gly 

Ser 

Leu 

Leu 

His 

Glu 

Phe 

Gly 

Leu 

Leu 

416 



275 





280 





285 




418 

Glu 

Ser 

Pro 

Lys 

Ala 

Leu 

Glu 

Glu 

Ala 

Pro 

Trp 

Pro 

Pro 

Pro 

Glu 

Gly 

419 


290 





295 





300 





421 

Ala 

Phe 

val 

Gly 

Phe 

Val 

Leu 

Ser 

Arg 

Lys 

Glu 

Pro 

Met 

Trp 

Ala 

Asp 

422 

305 





310 





315 





320 

424 

Leu 

Leu 

Ala 

Leu 

Ala 

Ala 

Ala 

Arg 

Gly 

Gly 

Arg 

Val 

His 

Arg 

Ala 

Pro 

425 





325 





330 





335 


427 

Glu 

Pro 

Tyr 

Lys 

Ala 

Leu 

Arg 

Asp 

Leu 

Lys 

Glu 

Ala 

Arg 

Gly 

Leu 

Leu 

428 




340 





345 





350 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:7; N Pos . 4,5,181,182,190,366,617,628,685,714,722,738,784,1022,1029 
Seq#:7; N Pos. 1038,1053,1098,1105,1206,1227,1244,1251,1252,1253,1350,1380 
Seq#:7; N Pos. 1497,1530,1569,1572,1641,1653,1655,1770,1812,2319,2346,2396 
Seq#:8; Xaa Pos . 2 , 63 , 109 , 186 , 205 , 209 , 227 , 228 , 233 , 240 , 243 , 244 , 247 , 260 , 290 
Seq#:8; Xaa Pos . 329 , 336 , 340 , 368 , 414 , 417 , 418 , 431 , 551, 605 , 773 , 794 , 798 , 823 , 833 
Seq#:57; N Pos. 18 
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VERIFICATION SUMMARY DATE: 05/14/2002 

PATENT APPLICATION: US/09/982,667 TIME: 15:01:10 

Input Set : N:\paola\09982667.txt 
Output Set: N:\CRF3\05142002\l982667.raw 

L:29 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:30 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:976 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 8 after pos . : 0 


L 

985 M:341 W: 

(46) 

"n" 

or * 

Xaa" 

used, 

for SEQ ID#: 

3 after pos . : 48 

L 

994 M:341 W: 

(46) 

"n" 

or ' 

Xaa" 

used, 

for SEQ ID#: 

3 after pos . : 96 

L 

1009 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

176 

L 

1012 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

192 

L 

1015 

M 

341 

W 

(46) 

-n- 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

208 

L 

1018 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

224 

L 

1021 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

240 

L 

1024 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

256 

L 

1030 

M:341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

288 

L 

1036 

M 

341 

W 

(46) 

"D" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

320 

L 

1039 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

336 

*L 

1042 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

IDS 

:8 

after 

pos . 

352 

L 

1051 

M 

341 

W 

(46) 

"D" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

400 

L 

1054 

M 

341 

W 

(46) 

"D" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

416 

L 

1078 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

544 

L 

1087 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

592 

L 

1120 

M 

341 

W 

(46) 


or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

768 

L 

1123 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

784 

L 

1129 

M:341 

W 

(46) 


or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

816 

L 

1132 

M 

341 

W 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# 

:8 

after 

pos . 

832 

L 

2271 

M 

246 

W 

Invalid 

value of 

Alpha 

Sequence 

Header Field, [TOPOLOGY 
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